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1 


<110> 


APPLICANT: KAWABATA 


, HIROSHI 


















2 




KOEFFLEER, H. PHILLIP 






















3 


<120> 


TITLE OF INVENTION: 


NUCLEIC 


ACIDS ENCODING TRANSFERRIN RECEPTOR 


4 




PROTEINS AND PRODUCTS RELATED THERETO 














5 


<13 0> 


FILE REFERENCE: 


8708/D7024 CEDERS-SINAI 


MEDICAL 


CENTER 






6 


<140> 


CURRENT 


APPLICATION 


NUMBER: 


US/09/358,755 












7 


<141> 


CURRENT 


FILING DATE 


: 1999-07-22 


















8 


<160> 


NUMBER OF SEQ ID NOS : 3 






















9 


<170> 


SOFTWARE: Patentln Ver. 


2 . U 




















10 


<210> 


SEQ 


ID NO 1 




























11 


<2 11> 


LENGTH : 


801 




























12 


<212> 


TYPE : PRT 




























13 


<213 > 


ORGANISM: human 


cells 






















14 


<400> 


SEQUENCE : 1 




























ID 




Met 


Glu 


Arg 


Leu 


Trp 


Gly 


Leu 




m n 

V7-L11 


Axg 


Ala 


d~\ n 




T .ei i 


Oar 




16 




1 








5 










t o 










15 




17 




Arg 


Ser 


Ser 


Gin 


Thr 


Val 


Tyr 


m n 


Airg 


Val 


vJXU 


fZl \7 

uiy 




Arg 


Lys 


Gly 


18 










20 










ZD 










30 






19 




His 


Leu 


Glu 


Glu 


Glu 


Glu 


Pin 


Asp 


m v 

w X y 


VJJ. Li 


VJl LI 


uiy 


Hid 


\J ±. u 


TVi t* 

J. ilJ- 


T.pn 

JJC Li 


20 








35 










4.0 










45 








21 




Ala 


His 


Phe 


Cys 


Pro 


Met 


Glu 


Leu 


Arg 




XT X. \J 


(Z~\ n 


13 ro 
CL. Lv 


T .011 
J-IC Li 


m v 


Car 


22 






50 










55 










60 










2 J 




Arg 


Pro 


Arg 


Gin 


Pro 


Asn 


Leu 


He 


Pro 


Trp 


Ala 


Ala 


Ala 


Gly 






24 




65 










70 










75 










80 


25 




Ala 


Ala 


Pro 


Tyr 


Leu 


Val 


Leu 


Thr 


Ala 


Leu 


Leu 


He 


Phe 


Thr 


tj±y 


A±a 


26 












85 










90 










95 




27 




Phe 


Leu 


Leu 


Gly 


Tyr 


Val 


Ala 


Phe 


Arg 


Gly 


Ser 


Cys 


Gin 


Ala 


Cys 


Gly 


28 










100 










105 










110 






29 




Asp 


Ser 


Val 


Leu 


Val 


Val 


Ser 


Glu 


Asp 


Val 


Asn 


Tyr 


Glu 


Pro 


Asp 


Leu 


30 








115 










120 










125 








31 




Asp 


Phe 


His 


Gin 


Gly 


Arg 


Leu 


Tyr 


Trp 


Ser 


Asp 


Leu 


Gin 


Ala 


Met 


Phe 


32 






130 










135 










140 










33 




Leu 


Gin 


Phe 


Leu 


Gly 


Glu 


Gly 


Arg 


Leu 


Glu 


Asp 


Thr 


He 


Arg 


Gin 


Thr 


34 




145 










150 










155 










160 


35 




Ser 


Leu 


Arg 


Glu 


Arg 


Val 


Ala 


Gly 


Ser 


Ala 


Gly 


Met 


Ala 


Ala 


Leu 


Thr 


36 












165 










170 










175 




37 




Gin 


Asp 


He 


Arg 


Ala 


Ala 


Leu 


Ser 


Arg 


Gin 


Lys 


Leu 


Asp 


His 


Val 


Trp 


38 










180 










185 










190 






39 




Thr 


Asp 


Thr 


His 


Tyr 


Val 


Gly 


Leu 


Gin 


Phe 


Pro 


Asp 


Pro 


Ala 


His 


Pro 


40 








195 










200 










205 








41 




Asn 


Thr 


Leu 


His 


Trp 


Val 


Asp 


Glu 


Ala 


Gly 


Lys 


Val 


Gly 


Glu 


Gin 


Leu 


42 






210 










215 










220 










43 




Pro 


Leu 


Glu 


Asp 


Pro 


Asp 


Val 


Tyr 


Cys 


Pro 


Tyr 


Ser 


Ala 


He 


Gly 


Asn 


44 




225 










230 










235 










240 
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45 
46 
47 
48 
49 
50 
51 
52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 
65 
66 
67 
68 
69 
70 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 
89 
90 
91 
92 
93 
94 



Val Thr Gly Glu Leu Val Tyr Ala His Tyr Gly Arg Pro Glu Asp Leu 

245 250 255 

Gin Asp Leu Arg Ala Arg Gly Val Asp Pro Val Gly Arg Leu Leu Leu 

260 265 270 

Val Arg Val Gly Val lie Ser Phe Ala Gin Lys Val Thr Asn Ala Gin 

275 280 285 

Asp Phe Gly Ala Gin Gly Val Leu lie Tyr Pro Glu Pro Ala Asp Phe 

290 295 300 

Ser Gin Asp Pro Pro Lys Pro Ser Leu Ser Ser Gin Gin Ala Val Tyr 
305 310 315 320 

Gly His Val His Leu Gly Thr Gly Asp Pro Tyr Thr Pro Gly Phe Pro 

325 330 335 

Ser Phe Asn Gin Thr Gin Phe Pro Pro Val Ala Ser Ser Gly Leu Pro 

340 345 350 

Ser lie Pro Ala Gin Pro lie Ser Ala Asp lie Ala Ser Arg Leu Leu 

355 360 365 

Arg Lys Leu Lys Gly Pro Val Ala Pro Gin Glu Trp Gin Gly Ser Leu 

370 375 380 

Leu Gly Ser Pro Tyr His Leu Gly Pro Gly Pro Arg Leu Arg Leu Val 
385 390 395 400 

Val Asn Asn His Arg Thr Ser Thr Pro lie Asn Asn lie Phe Gly Cys 

405 410 415 

He Glu Gly Arg Ser Glu Pro Asp His Tyr Val Val He Gly Ala Gin 

420 425 430 

Arg Asp Ala Trp Gly Pro Gly Ala Ala Lys Ser Ala Val Gly Thr Ala 

435 440 445 

He Leu Leu Glu Leu Val Arg Thr Phe Ser Ser Met Val Ser Asn Gly 

450 455 460 

Phe Arg Pro Arg Arg Ser Leu Leu Phe He Ser Trp Asp Gly Gly Asp 
465 470 475 480 

Phe Gly Ser Val Gly Ser Thr Glu Trp Leu Glu Gly Tyr Leu Ser Val 

485 490 495 

Leu His Leu Lys Ala Val Val Tyr Val Ser Leu Asp Asn Ala Val Leu 

500 505 510 

Gly Asp Asp Lys Phe His Ala Lys Thr Ser Pro Leu Leu Thr Ser Leu 

515 520 525 

He Glu Ser Val Leu Lys Gin Val Asp Ser Pro Asn His Ser Gly Gin 

530 535 540 

Thr Leu Tyr Glu Gin Val Val Phe Thr Asn Pro Ser Trp Asp Ala Glu 
545 550 555 560 

Val He Arg Pro Leu Pro Met Asp Ser Ser Ala Tyr Ser Phe Thr Ala 

565 570 575 

Phe Val Gly Val Pro Ala Val Glu Phe Ser Phe Met Glu Asp Asp Gin 

580 585 590 

Ala Tyr Pro Phe Leu His Thr Lys Glu Asp Thr Tyr Glu Asn Leu His 

595 600 605 

Lys Val Leu Gin Gly Arg Leu Pro Ala Val Ala Gin Ala Val Ala Gin 

610 615 620 

Leu Ala Gly Gin Leu Leu He Arg Leu Ser His Asp Arg Leu Leu Pro 
625 630 635 640 



# 
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QC 


T.oi i 




It lie 




ni y j. y x 






val 


V CLX 


XIC u. 


Arg 


XIX 0 


Tie 

X xc 




Asn 


96 










645 








650 










655 




Q7 


T .oi i 
LcU 


7\ en 




It 11C 


Gay ftl "1 \/ 

ocx o±y 


nay* 


T.oii 


uy o 


Al a 


Arg 


vjx y 


T .oil 
ucu 


X 111. 


XIC LI 


Oln 

VJ7 X 11 


QO 
-7 O 








o o u 








o o j 










670 






q q 


irp 


va x 


ryr 


Oar 
DCl 


nla nlvj 


m V 
oiy 


A en 


iyr 


Tip 
X xc 




al a 

raXci 


Ala 

nla 


will 


xi y o 


T .on 

XIC u 


i no 














con 

O O \J 










O O 








i n i 


Arg 


tain 


blU 


lie 


iyr ocx 


Ser 


nl ii 




A ■yrr 

Arg 


A en 
nop 


rcl ii 


Arg 


T .01 1 


1 11X 


Arg 


102 
X v ^ 




690 








695 










700 










103 


Met 


Tyr 


Asn 


vai 


Arg lie 


Met 


Arg 


vai 


U1U 


pne 


Tyr 


pne 


Leu 


Ser 


bin 


104 


705 








710 










715 










720 


105 


Tyr 


Val 


Ser 


Pro 


Ala Asp 


Ser 


Pro 


Phe 


Arg 


His 


He 


Phe 


Met 


Gly 


Arg 


106 










725 








730 










735 




107 


Gly 


Asp 


His 


Thr 


Leu Gly 


Ala 


Leu 


Leu 


Asp 


His 


Leu 


Arg 


Leu 


Leu 


Arg 


108 








740 








745 










750 






109 


Ser 


Asn 


Ser 


Ser 


Gly Thr 


Pro 


Gly 


Ala 


Thr 


Ser 


Ser 


Thr 


Gly 


Phe 


Gin 


110 






755 








760 










765 








111 


Glu 


Ser 


Arg 


Phe 


Arg Arg 


Gin 


Leu 


Ala 


Leu 


Leu 


Thr 


Trp 


Thr 


Leu 


Gin 


112 




770 








775 










780 










113 


Gly 


Ala 


Ala 


Asn 


Ala Leu 


Ser 


Gly 


Asp 


Val 


Trp 


Asn 


He 


Asp 


Asn 


Asn 


114 


785 








790 










795 










800 



115 Phe 

116 <210> SEQ ID NO 2 

117 <211> LENGTH: 2877 

118 <212> TYPE: DNA 

119 <213> ORGANISM: human genome 

12 0 <400> SEQUENCE: 2 

121 ctgeaggett caggagggga cacaagcatg gageggcttt ggggtctatt ecagagageg 60 

122 caacaactgt ccccaagatc ctctcagacc gtctaccagc gtgtggaagg cccccggaaa 12 0 

123 gggcacctgg aggaggaaga ggaagacggg gaggaggggg eggagacatt ggcccacttc 180 

124 tgccccatgg agctgagggg ccctgagccc ctgggctcta gacccaggca gccaaacctc 240 

125 attccctggg eggcagcagg aeggaggget gccccctacc tggtcctgac ggccctgctg 300 

126 atcttcactg gggccttcct actgggctac gtcgccttcc gagggtcctg ccaggcgtgc 360 

127 ggagactctg tgttggtggt cagtgaggat gtcaactatg agcctgacct ggatttccac 420 

128 cagggcagac tctactggag cgacctccag gccatgttcc tgcagttcct gggggagggg 480 

129 cgcctggagg acaccatcag gcaaaccagc ettegggaac gggtggcagg ctcggccggg 540 

13 0 atggccgctc tgactcagga cattcgcgcg gcgctctccc gecagaaget ggaccacgtg 600 

131 tggaccgaca cgcactacgt ggggctgcaa ttcccggatc cggctcaccc caacaccctg 660 

132 cactgggtcg atgaggcegg gaaggtcgga gagcagctgc cgctggagga ccctgacgtc 720 

133 tactgcccct acagcgccat cggcaacgtc aegggagage tggtgtacgc ccactacggg 780 

134 cggcccgaag acctgeagga cctgcgggcc aggggcgtgg atccagtggg ccgcctgctg 840 

135 ctggtgcgcg tgggggtgat cagcttcgcc cagaaggtga ccaatgctca ggacttcggg 900 
13 6 gctcaaggag tgctcatata cccagagcca geggacttet cccaggaccc acccaagcca 960 

137 agcctgtcca gccagcaggc agtgtatgga catgtgcacc tgggaactgg agacccctac 102 0 

138 acacctggct tcccttcctt caatcaaacc cagttccctc cagttgeate ateaggcett 1080 
13 9 cccagcatcc cagcccagcc catcagtgea gaeattgect cccgcctgct gaggaagctc 1140 

140 aaaggccctg tggcccccca agaatggcag gggagcctcc taggctcccc ttatcacctg 1200 

141 ggccccgggc cacgactgcg gctagtggtc aacaatcaca ggacctccac ccccatcaac 1260 

142 aacatcttcg getgeatega aggccgctca gagecagate actacgttgt catcggggcc 1320 

143 cagagggatg catggggccc aggagcagct aaatccgctg tggggaegge tatactcctg 1380 

144 gagctggtgc ggaccttttc ctccatggtg ageaaegget tccggccccg cagaagtctc 1440 



i 
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145 ctcttcatca gctgggacgg tggtgacttt ggaagcgtgg gctccacgga gtggctagaa 1500 

146 ggctacctca gcgtgctgca cctcaaagcc gtagtgtacg tgagcctgga caacgcagtg 1560 

147 ctgggggatg acaagtttca tgccaagacc agcccccttc tgacaagtct cattgagagt 162 0 

148 gtcctgaagc aggtggattc tcccaaccac agtgggcaga ctctctatga acaggtggtg 1680 

149 ttcaccaatc ccagctggga tgctgaggtg atccggcccc tacccatgga cagcagtgcc 174 0 

150 tattccttca cggcctttgt gggagtccct gccgtcgagt tctcctttat ggaggacgac 1800 

151 caggcctacc cattcctgca cacaaaggag gacacttatg agaacctgca taaggtgctg 1860 

152 caaggccgcc tgcccgccgt ggcccaggcc gtggcccagc tcgcagggca gctcctcatc 1920 

153 cggctcagcc acgatcgcct gctgcccctc gacttcggcc gctacgggga cgtcgtcctc 1980 

154 aggcacatcg ggaacctcaa cgagttctct ggggacctca aggcccgcgg gctgaccctg 2040 

155 cagtgggtgt actcggcgcg gggggactac atccgggcgg cggaaaagct gcggcaggag 2100 

156 atctacagct cggaggagag agacgagcga ctgacacgca tgtacaacgt gcgcataatg 2160 

157 cgggtggagt tctacttcct ttcccagtac gtgtcgccag ccgactcccc gttccgccac 2220 

158 atcttcatgg gccgtggaga ccacacgctg ggcgccctgc tggaccacct gcggctgctg 2280 

159 cgctccaaca gctccgggac ccccggggcc acctcctcca ctggcttcca ggagagccgt 2340 

160 ttccggcgtc agctagccct gctcacctgg acgctgcaag gggcagccaa tgcgcttagc 2400 

161 ggggatgtct ggaacattga taacaacttc tgaggccctg gggatcctca catccccgtc 2460 

162 ccccagtcaa gagctcctct gctcctcgct tgaatgattc agggtcaggg aggtggctca 252 0 

163 gagtccacct ctcattgctg atcaatttct cattacccct acacatctct ccacggagcc 2580 

164 cagaccccag cacagatatc cacacacccc agccctgcag tgtagctgac cctaatgtga 2640 

165 cggtcatact gtcggttaat cagagagtag catcccttca atcacagccc cttccccttt 2700 

166 ctggggtcct ccatacctag agaccactct gggaggtttg ctaagccctg ggacctggcc 2760 

167 agctctgtta gtgggagaga tcgctggcac catagcctta tggccaacag gtggtctgtg 2820 

168 gtgaaagggg cgtggagttt caatatcaat aaaccacctg atatcaataa gccaaaa 2877 

169 <210> SEQ ID NO 3 

170 <211> LENGTH: 2519 

171 <212> TYPE: DNA 

172 <213> ORGANISM: human genome 

173 <400> SEQUENCE: 3 

174 gcgtccgcgg ggagcgctct tttcctaaac tcaggaaccc ctcgccgccc ctgcccctgg 60 

175 cgaccccacg tctctggcat ccttccctct tccctccctc tcctccgggc gcccaaaaaa 120 

176 gtccccacct ctccccgctt aggcaaacca gccttcggga acgggtggca ggctcggccg 180 

177 ggatggccgc tctgactcag gacattcgcg cggcgctctc ccgccagaag ctggaccacg 240 

178 tgtggaccga cacgcactac gtggggctgc aattcccgga tccggctcac cccaacaccc 3 00 

179 tgcactgggt cgatgaggcc gggaaggtcg gagagcagct gccgctggag gaccctgacg 360 

180 tctactgccc ctacagcgcc atcggcaacg tcacgggaga gctggtgtac gcccactacg 420 

181 ggcggcccga agacctgcag gacctgcggg ccaggggcgt ggatccagtg ggccgcctgc 480 

182 tgctggtgcg cgtgggggtg atcagcttcg cccagaaggt gaccaatgct caggacttcg 540 

183 gggctcaagg agtgctcata tacccagagc cagcggactt ctcccaggac ccacccaagc 600 

184 caagcctgtc cagccagcag gcagtgtatg gacatgtgca cctgggaact ggagacccct 660 

185 acacacctgg cttcccttcc ttcaatcaaa cccagttccc tccagttgca tcatcaggcc 720 

186 ttcccagcat cccagcccag cccatcagtg cagacattgc ctcccgcctg ctgaggaagc 780 

187 tcaaaggccc tgtggccccc caagaatggc aggggagcct cctaggctcc ccttatcacc 840 

188 tgggccccgg gccacgactg cggctagtgg tcaacaatca caggacctcc acccccatca 900 

189 acaacatctt cggctgcatc gaaggccgct cagagccaga tcactacgtt gtcatcgggg 960 

190 cccagaggga tgcatgggcc ccaggagcag ctaaatccgc tgtggggacg gctatactcc 102 0 

191 tggagctggt gcggaccttt tcctccatgg tgagcaacgg cttccggccc cgcagaagtc 1080 

192 tcctcttcat cagctgggac ggtggtgact ttggaagcgt gggctccacg gagtggctag 1140 

193 aaggctacct cagcgtgctg cacctcaaag ccgtagtgta cgtgagcctg gacaacgcag 1200 

194 tgctggggga tgacaagttt catgccaaga ccagccccct tctgacaagt ctcattgaga 1260 
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195 gtgtcctgaa gcaggtggat tctcccaacc acagtgggca gactctctat gaacaggtgg 132 0 

196 tgttcaccaa tcccagctgg gatgctgagg tgatccggcc cctacccatg gacagcagtg 1380 

197 cctattcctt cacggccttt gtgggagtcc ctgccgtcga gttctccttt atggaggacg 1440 

198 accaggccta cccattcctg cacacaaagg aggacactta tgagaacctg cataaggtgc 1500 

199 tgcaaggccg cctgcccgcc gtggcccagg ccgtggccca gctcgcaggg cagctcctca 1560 

200 tccggctcag ccacgatcgc ctgctgcccc tcgacttcgg ccgctacggg gacgtcgtcc 1620 

201 tcaggcacat cgggaacctc aacgagttct ctggggacct caaggcccgc gggctgaccc 1680 

202 tgcagtgggt gtactcggcg cggggggact acatccgggc ggcggaaaag ctgcggcagg 1740 

203 agatctacag ctcggaggag agagacgagc gactgacacg catgtacaac gtgcgcataa 1800 

204 tgcgggtgga gttctacttc ctttcccagt acgtgtcgcc agccgactcc ccgttccgcc 1860 

205 acatcttcat gggccgtgga gaccacacgc tgggcgccct gctggaccac ctgcggctgc 1920 

206 tgcgctccaa cagctccggg acccccgggg ccacctcctc cactggcttc caggagagcc 1980 

207 gtttccggcg tcagctagcc ctgctcacct ggacgctgca aggggcagcc aatgcgctta 2040 

208 gcggggatgt ctggaacatt gataacaact tctgaggccc tggggatcct cacatccccg 2100 

209 tcccccagtc aagagctcct ctgctcctcg cttgaatgat tcagggtcag ggaggtggct 2160 

210 cagagtccac ctctcattgc tgatcaattt ctcattaccc ctacacatct ctccacggag 2220 

211 cccagacccc agcacagata tccacacacc ccagccctgc agtgtagctg accctaatgt 2280 

212 gacggtcata ctgtcggtta atcagagagt agcatccctt caatcacagc cccttcccct 2340 

213 ttctggggtc ctccatacct agagaccact ctgggaggtt tgctaggccc tgggacctgg 2400 

214 ccagctctgt tagtgggaga gatcgctggc accatagcct tatggccaac aggtggtctg 246 0 

215 tggtgaaagg ggcgtggagt ttcaatatca ataaaccacc tgatatcaat aagccaaaa 2519 
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